Speech Enhancement Using Temporal Masking in the FFT Domain

نویسندگان

Yao Wang

Jiong An

Teddy Surya Gunawan

Eliathamby Ambikairajah

چکیده

Temporal masking models have not been previously applied in the Fast Fourier Transform (FFT) domain for speech enhancement applications. This paper presents a novel speech enhancement algorithm using temporal masking in the FFT domain. The proposed algorithm is suitable for the cochlear speech processor and for other speech applications. The input signal is analysed using FFT and then grouped into 22 critical bands. The noise power is estimated using a minimum statistics noise tracking algorithm. A short-term temporal masking threshold is then calculated for each critical band and a gain factor for each band is then computed. The objective and subjective evaluations show that the temporal masking model based speech enhancement scheme outperforms the traditional Wiener filtering approach in the FFT domain.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Integrated speech enhancement and coding in the time-frequency domain

This paper addresses the problem of merging speech enhancement and coding in the context of an auditory modeling. The noisy signal is rst processed by a fast wavelet packet transform algorithm to obtain an auditory spectrum, from which a rough masking model is estimated. Then, this model is used to re ne a subtractive-type enhancement algorithm. The enhanced speech coe cients are then encoded i...

متن کامل

Speech Enhancement using Temporal Masking and Fractional Bark Gammatone Filters

A speech enhancement technique based on the temporal masking properties of the human auditory system is presented. The noisy signal is divided into a number of sub-bands with fractional bark accuracy, and the sub-band signals are individually and adaptively weighted in the time domain according to a short-term temporal masking threshold to noise ratio estimate in each subband. Objective measure...

متن کامل

Phoneme Classification Using Temporal Tracking of Speech Clusters in Spectro-temporal Domain

This article presents a new feature extraction technique based on the temporal tracking of clusters in spectro-temporal features space. In the proposed method, auditory cortical outputs were clustered. The attributes of speech clusters were extracted as secondary features. However, the shape and position of speech clusters change during the time. The clusters temporally tracked and temporal tra...

متن کامل

Single channel speech enhancement by frequency domain constrained optimization and temporal masking

A speech enhancement algorithm is proposed that exploits the masking properties of the human auditory system. The enhancement is formulated as a frequency domain constrained optimization problem. The noise components of the noisy speech are suppressed by a gain function subject to the constraint that both the signal distortion and residual noise should fall below the masking thresholds. Tempora...

متن کامل

Perceptual speech enhancement exploiting temporal masking properties of human auditory system

The use of simultaneous masking in speech enhancement has shown promise for a range of noise types. In this paper, a new speech enhancement algorithm based on a short-term temporal masking threshold to noise ratio (MNR) is presented. A novel functional model for forward masking based on three parameters is incorporated into a speech enhancement framework based on speech boosting. The performanc...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2006

Speech Enhancement Using Temporal Masking in the FFT Domain

نویسندگان

چکیده

منابع مشابه

Integrated speech enhancement and coding in the time-frequency domain

Speech Enhancement using Temporal Masking and Fractional Bark Gammatone Filters

Phoneme Classification Using Temporal Tracking of Speech Clusters in Spectro-temporal Domain

Single channel speech enhancement by frequency domain constrained optimization and temporal masking

Perceptual speech enhancement exploiting temporal masking properties of human auditory system

عنوان ژورنال:

اشتراک گذاری